Analysis and Description of ABC Submission to NIST SRE 2016
نویسندگان
چکیده
We present a condensed description and analysis of the joint submission for NIST SRE 2016, by Agnitio, BUT and CRIM (ABC). We concentrate on challenges that arose during development and we analyze the results obtained on the evaluation data and on our development sets. We show that testing on mismatched, non-English and short duration data introduced in NIST SRE 2016 is a difficult problem for current state-of-theart systems. Testing on this data brought back the issue of score normalization and it also revealed that the bottleneck features (BN), which are superior when used for telephone English, are lacking in performance against the standard acoustic features like Mel Frequency Cepstral Coefficients (MFCCs). We offer ABC’s insights, findings and suggestions for building a robust system suitable for mismatched, non-English and relatively noisy data such as those in NIST SRE 2016.
منابع مشابه
SUT System Description for NIST SRE 2016
This paper describes the submission to fixed condition of NIST SRE 2016 by Sharif University of Technology (SUT) team. We provide a full description of the systems that were included in our submission. We start with an overview of the datasets that were used for training and development. It is followed by describing front-ends which contain different VAD and feature types. UBM and i-vector extr...
متن کاملSUT Submission for NIST 2016 Speaker Recognition Evaluation: Description and Analysis
In this paper, the most recent Sharif University of Technology (SUT) speaker recognition system developed for NIST 2016 Speaker Recognition Evaluation (SRE) is described. The major challenge in this evaluation is the language mismatch between training and evaluation data. The submission is related to the fixed condition of NIST SRE 2016 and features a full description of the database and the sy...
متن کاملIdiap Submission to the Nist Sre 2016 Speaker Recognition Evaluation
Idiap has made one submission to the fixed condition of the NIST SRE 2016. It consists of two gender-dependent ivector systems that use posteriors from a Universal Background Model and a Deep Neural Network, respectively, whose scores have been fused via logistic regression. Both systems use Linear Discriminant Analysis (LDA) for i-vector post-processing and Probabilistic LDA for inference. The...
متن کاملThe I3a speaker recognition system for NIST SRE12: post-evaluation analysis
The I3A submission for the recent NIST 2012 speaker recognition evaluation (SRE) was based on the i-vector approach with a multi-channel PLDA classifier. This PLDA is modified so that, for each i-vector, the between-class covariance depends on the type of channel where the segment was recorded (telephone,interviews,clean, noisy, etc). In this paper, we present the description of our submission ...
متن کاملSpeaker Verification on Summed-Channel Conditions with Confidence Measures Verificación de locutor en condiciones de canal sumado con medidas de confianza
This paper addresses the problem of speaker verification in two speaker conversations, proposing a set of confidence measures to assess the quality of a given speaker segmentation. We study how these measures can be used to estimate the performance of a state-of-the-art speaker verification system, the I3A submission for the core-summed condition in the NIST SRE 2010. We present a Factor Analys...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2017